Corpus: eng-de_web_2014_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 98 99 99 99 99
1000 880 986 998 999 999
10000 5799 9279 9838 9945 9963
100000 5800 9280 9839 9946 9964
1000000 5800 9280 9839 9946 9964


Zipf's diagram for sentence endings


Gnuplot diagram

863 msec needed at 2018-04-13 09:54